Fault Tolerance in Cloud Using Reactive and Proactive Techniques
نویسنده
چکیده
Fault tolerance plays a vital role in ensuring high serviceability and reliability in cloud. A lot of research is currently under way to analyze how cloud can provide fault tolerance for an application. The work proposes a reactive fault tolerant technique that uses check pointing to tolerate the fault. The work proposes a VMμ Checkpoint framework to protect both VMs and applications in the VMs against transient errors. The VM-μCheckpoint mechanism is implemented using CoW-PC (Copy on Write – Presave in cache) algorithm. The CoW-PC algorithm presaves all the tasks running on the VM’s in a cache memory. When there is any transient failure happening in VMs, it is noted and it is recovered using last presaved checkpoint from the cache memory. Once the tasks are executed successfully, the presaved checkpoints are deleted automatically from the cache memory. Thus the algorithm uses in place and in memory recovery of checkpoints that reduces the checkpoint overhead and improves the performance.
منابع مشابه
A Survey on Fault Tolerance Techniques in Cloud Computing
Cloud computing becomes very popular these days and fault tolerance is a major issue in this cloud environment. This paper is dedicated to study different kinds of faults and various techniques for handling them. This survey mainly focuses on why fault tolerance is required in cloud computing and which techniques are used to handle these failures so that services can be delivered without delay ...
متن کاملImproving the palbimm scheduling algorithm for fault tolerance in cloud computing
Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...
متن کاملUsing Proactive Fault-Tolerance Approach to Enhance Cloud Service Reliability
The large-scale utilization of cloud computing services for hosting industrial/enterprise applications has led to the emergence of cloud service reliability as an important issue for both cloud service providers and users. To enhance cloud service reliability, two types of fault tolerance schemes, reactive and proactive, have been proposed. Existing schemes rarely consider the problem of coordi...
متن کاملA Replica Distribution Based Fault Tolerance Management For Cloud Computing
Cloud computing now a day's become most popular and reliable computing technique for organizations and individuals. In the cloud environments, data availability and backup replication are critical and complex issues in the an efficient fault tolerance policy is the major. Fault tolerance policy is the strategy in action when a fault occurs in the system. Taking backups is the one of the most us...
متن کاملSecond workshop on proactive failure avoidance, recovery, and maintenance (PFARM)
Proactive approaches to failure avoidance, recovery and maintenance have recently attracted increased interest among researchers and practitioners from various areas of dependable system design and operation. This first workshop provided a stimulating, and fruitful forum to foster collaboration among researchers working on proactive fault management, to discuss ideas, exchange experiences and t...
متن کامل